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Abstract —We consider an energy harvesting channel, in vrhich 
the transmitter is powered by an exogenous stochastic energy 
harvesting process Et, such that 0 < Et < E, which can be 
stored in a battery of finite size B. We provide a simple and 
insightful formula for the approximate capacity of this channel 
with bounded guarantee on the approximation gap independent 
of system parameters. This approximate characterization of the 
capacity identifies two qualitatively different operating regimes 
for this channel: in the large battery regime, when B > E, the 
capacity is approximately equal to that of an AWGN channel 
with an average power constraint equal to the average energy 
harvesting rate, i.e. it depends only on the mean of Et and is 
(almost) independent of the distribution of Et and the exact value 
of B. In particular, this suggests that a battery size B ^ E \s 
approximately sufficient to extract the infinite battery capacity 
of the system. In the small battery regime, when B < E, we 
clarify the dependence of the capacity on the distribution of Et 
and the value of B. 

There are three steps to proving this result which can be of 
interest in their own right: 1) we characterize the capacity of this 
channel as an n-letter mutual Information rate under various 
assumptions on the availability of energy arrival information: 
causal and noncausal knowledge of the energy arrivals at the 
transmitter with and without knowledge at the receiver; 2) we 
characterize the approximately optimal online power control 
policy that maximizes the long-term average throughput of the 
system; 3) we show that the information-theoretic capacity of this 
channel is equal, within a constant gap, to its long-term average 
throughput. This last result provides a connection between the 
information- and communication-theoretic formulations of the 
energy-harvesting communication problem that have been so far 
studied in isolation. 


I. Introduction 

Energy-harvesting is quickly becoming a game-changing 
technology for many wireless systems. The promise of self- 
sustained perpetual operation opens exciting possibilities for 
a wide range of applications from powering base stations in 
rural areas with renewable energy sources (ex. wind or sun) 
to building in-body wireless networks powered by body heat, 
motion or RF energy transfer. However, energy harvesting also 
brings a fundamental shift in communication system design 
principles. In conventional systems, energy (or power) is a 
deterministic quantity continuously available to the transmitter 
and communication is typically constrained only in terms 
of average power. In harvesting systems, energy may not 
be generated at all times and the rate of energy generation 
can be unpredictable and fluctuate significantly over time. In 
such systems, energy that becomes available for information 
transmission can be modeled as a stochastic rather than a 
deterministic process. 


Et 



Fig. 1. Energy harvesting AWGN channel model. 


Communication under an average power constraint is well- 
understood. Shannon’s most famous capacity formula 

C = lVlog(^l + ^) bits/s, (1) 

provides the basis for designing efficient communication sys¬ 
tems operating under this constraint. This formula not only 
quantifies exactly the performance limit of communication 
over an additive white gaussian noise (AWGN) channel con¬ 
strained to an average power of P Watts, allocated bandwidth 
W Hz, and subject to white noise of power spectral density 
Nq/2 Watts/Hz, but perhaps even more importantly also 
identifies two fundamentally different operating regimes for 
this channel where the dependence of the capacity on major 
system parameters is qualitatively different. In the power- 
limited (or low-signal-to-noise ratio (SNR)) regime, where 
SNR := P/NqW <C 0 dB, the capacity is approximately 
linear in the power, and the performance depends critically 
on the power available but not so much on the bandwidth. In 
the bandwidth-limited (or high-SNR) regime, where SNR ^ 0 
dB, the capacity is approximately linear in the bandwidth and 
the performance depends critically on the bandwidth but not so 
much on the power. The regime is determined by the interplay 
between the amount of power and bandwidth available. The 
design of good communication schemes for wireless systems 
has been primarily driven by the parameter regime one is in. 

Despite significant recent effort |[T)-Q, there is no anal¬ 
ogous understanding for energy harvesting communication 
systems. Fig. depicts the basic model that captures this 
form of communication. Here a transmitter powered by an 
exogenous stochastic energy arrival process Et equipped with 
a battery of size B is communicating to a receiver over an 
AWGN channel. The available energy for transmission at any 
given time is limited by the amount of energy available in 
the battery Bt, which in turn depends on the previous energy 
arrivals as well as the energy consumed in the earlier time 
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slots; 

< Bu 

Bt = inin{i?(_i — X^_i + Et, B}. (2) 

This leads to a complex power constraint on the transmitter 
which has memory and is input-dependent. The problem is 
further complicated by the fact that in a typical scenario the 
realizations of Et are known (causally) only at the transmitter 
and not at the receiver. Obtaining an insightful capacity 
formula for such a transmitter has proven difficult. As a result, 
even very basic questions concerning the design of energy 
harvesting communication systems remain poorly understood, 
such as: 

• how does the system capacity (at least roughly) depend 
on the energy harvesting profile Et and the battery size 
B? 

• are there different operating regimes where this depen¬ 
dence is qualitatively different (analogous to the two 
operating regimes of the classical AWGN channel)? 

• what are the properties of the process Et (ex. mean, 
variance etc.) that most critically determine capacity? as a 
result, what are more favorable and less favorable energy 
harvesting profiles? 

• given an energy harvesting profile Et, how large should 
we choose the size of the battery B so as to extract most 
of the system capacity? 

Previous work 0. 0 provides an approximate formula for 
the capacity of this channel when energy arrivals are i.i.d. 
Bernoulli, which sheds some light on these questions in this 
specific case. 

A. Our Contribution 

In this paper, we derive a simple and insightful formula for 
the approximate capacity of this channel which holds for any 

1.1. d. process Et- In particular, we show that when Et is an 

1.1. d process, the capacity of this channel can be approximated 

1 

C « - log (l -I- E[min{Bt, B}]) bits/s/Hz (3) 

within 3.85 bits/s/Hz, where we assume that the noise variance 
is normalized to l01]This characterization identifies two fun¬ 
damentally different operating regimes for this channel where 
the dependence of the capacity on Et and B is qualitatively 
different. 

Let Et take values in the interval [0,B]. When B > E, 
0 becomes 

(7 « i log (1 -I- E[Bt]) bits/s/Hz, (4) 

^Different from here we focus on the capacity in bits/s/Hz and assume 
that the noise variance is normalized to 1 in order to highlight the dependence 
of the capacity on the new channel parameters. The normalization of the noise 
power to 1 is without loss of generality since the approximation result is 
independent of the parameters of the problem. 

^The additive capacity approximations we develop in this paper are most 
relevant in the high-SNR regime. Note that this can indeed be the operating 
regime of a low power wireless device if the available power is concentrated 
on a very narrow frequency band. This ultra-narrow band approach is indeed 
the defacto technique for some low power loE devices that, despite being 
low-power, operate at reasonably high SNRs 0. G3- 



(a) B > E 

Fig. 2. pdf of Et in the two battery regimes. 


(b) B < E 


and the capacity is approximately equal to that of an AWGN 
channel with an average power constraint equal to the average 
energy harvesting rate. Note that the right-hand side of Q 
trivially upper bounds the capacity of the energy harvesting 
channel as this would be the capacity if the transmitter were 
only constrained in its average transmission power (or average 
energy per channel use), which can obviously not exceed the 
average rate of the incoming energy. Ozel and Ulukus in ||T| 
show that this upper bound can be achieved when B = oo. 
Our result suggests that this large battery regime kicks in 
much earlier, as soon as the battery size B is large enough 
to accommodate the maximal amount of energy that can be 
harvested over a single channel use. This is surprising given 
that the transmitter is limited by the additional constraint (|^, 
and at finite B this can lead to part of the harvested energy 
being wasted due to an overflow in the battery capacity. While 
there is a very natural way to achieve the AWGN capacity in 
© with B = oo - the transmitter can simply remain silent for 
a duration of time sublinear in the blocklength to accumulate 
sufficient energy in the battery and then use standard i.i.d. 
Gaussian coding - achieving the AWGN capacity at finite B 
is intricate and in particular requires an optimal online power 
control strategy. Note that in this large battery regime, which 
© identifies as the case B > E, the capacity approximation 
depends only on the mean of the energy harvesting process: 
two energy harvesting profiles are equivalent as long as they 
provide the same energy on the average. The approximate 
capacity is also independent of the exact size of B. In 
particular, choosing B « B is almost sufficient to extract 
the infinite battery capacity. More precisely, there is limited 
capacity gain in making B much larger than E. 

When B < E, note that one can equivalently consider the 
distribution of Et to be that in Fig. ^(b)| since every energy 
arrival with value Et > B fully recharges the battery, this 
creates a point mass at B with value P{Et > B). In this 
case, © reveals that the capacity is approximately given by 
the mean of this modified distribution. This can be interpreted 
as the small battery regime of the channel. In particular, in 
this regime the capacity roughly depends both on the shape of 
the distribution of Et and the value of B. For example, while 
two energy harvesting mechanisms providing the same average 
energy lead to the same approximate capacity in the large 
battery regime, in the small battery regime they are likely to 
yield different capacity approximations. In particular, constant 
energy arrivals will maximize capacity among all distributions 
with the same mean. Note that while Q suggests that choosing 











3 


B Ki E allows to almost extract the infinite battery capacity 
of the channel, it also quantifies the performance loss when 
B < E. In particular, the performance loss is dictated by the 
difference between the mean of the original distribution for Et 
in Fig. [2]|(a) and that of the modified distribution in Fig. [2|(b) 


Note that when Et is unbounded but has a fast decaying tail, 
choosing B large but finite can be sufficient to approach the 
infinite battery capacity. 


There are three major steps to proving the approximation 
result in ([^ which can be of interest in their own right. We 
first characterize the exact capacity of the energy harvesting 
channel in Fig. as the limit of an n-letter maximum mutual 
information rate under various assumptions on the available 
information regarding the energy harvesting process, such 
as causal or noncausal information at the transmitter with 
or without information at the receiver. Obtaining an n-letter 
mutual information expression for the capacity of this channel 
is nontrivial and has remained an open problem until now, 
since the energy constraints on the transmitter lead to an 
input-dependent random state with memory for the system. In 
particular, it is not a priori clear if the channel is information- 
stable or not. Earlier characterizations of the capacity ||^ have 
been only available in terms of the Verdu-Han framework mi- 

The second step is to devise an approximately optimal 
online power control policy for the energy harvesting channel 
that maximizes its long-term average throughput. Optimal 
power control for energy harvesting systems has been of 
significant interest over the recent years in the communication 
theory literature 10-10. This formulation simplifies the 
communication problem by assuming that there is an underly¬ 
ing transmission scheme operating at a finer time-scale such 
that allocating power P to this scheme yields an information 
rate r{P) = ^ log(l -I- P) bits/s and focuses on maximiz¬ 
ing the long-term average throughput of the system subject 
to energy availability constraints imposed by the harvesting 
process analogous to (|^. The problem has been studied in two 
different settings. In the offline case, where energy arrivals 
are known ahead of time, the optimal power control policy 
has been characterized in |12|- 0. The optimal strategy in 
this case keeps energy consumption as constant as possible 
over time while ensuring energy is never wasted due to an 
overflow in the battery capacity. The more interesting online 
case remains poorly understood. In this setting, the energy 
arrival process is observed causally at the transmitter and only 
statistical information regarding the future energy arrivals is 
available. In this paper, we develop a simple online power 
control policy based on 0^® and show that it is at most 1.8 
bits/s/Hz to optimality, independent of system parameters. This 
strategy waits until the battery is fully charged to B and then 
allocates the energy B in an exponentially decaying manner 
until the next time instant when the battery is fully recharged. 
This near-optimal solution suggests that the optimal power 
control strategy in the online case is structurally different from 
the offline case, where the optimal policy rather aims to keep 
power allocation as constant as possible across different time- 
slots. 

The third step in proving ([^ is to connect the two different 


formulations of the energy harvesting communication problem 
discussed above: the information-theoretic formulation which 
aims to characterize the fundamental capacity of the channel, 
and the communication-theoretic power control problem which 
aims to maximize the long-term average throughput of the 
channel. We first show that the long-term average throughput 
provides an upper bound on the true information-theoretic 
capacity of the channel. For example, the optimal offline 
throughput provides an upper bound on the information- 
theoretic capacity with noncausal knowledge of the energy 
arrivals at the transmitter and the receiver, while the optimal 
online throughput is an upper bound on the capacity with 
causal information regarding the energy arrivals. Perhaps more 
surprisingly, we also show that given an optimal power control 
policy there is a natural way to construct explicit schemes 
which achieve a rate within a constant gap of the upper bound 
(the corresponding long-term average throughput) for any i.i.d. 
energy harvesting process and any value of B. This allows us 
to conclude that the optimal solutions of these two problems, 
the information-theoretic capacity and the long-term average 
throughput, cannot differ from each other by a constant gap 
(ex. the gap is at most 1.05 bits/s/Hz with receiver side 
information). In particular, using the approximately optimal 
online power control policy we develop in the previous step, 
we obtain the approximation (|^ for the information theoretic 
capacity. To the best of our knowledge, this is the first 
work that establishes an explicit relation between the two 
different formulations of the energy harvesting communication 
problem, so far studied separately in the respective information 
and communication theory literatures. 


B. Related Work 

The information-theoretic capacity of the energy harvesting 
channel in Fig.[T]has been of significant recent interest 0-®. 
In particular, 0^ shows that when B = oo the capacity of the 
energy harvesting channel is the same as that of an AWGN 
channel with average power constraint equal to the average 
energy harvesting rate Follow-up works provide upper 

and lower bounds on the capacity for the more realistic 
case of finite battery and n-letter capacity expressions for 
some special cases. In particular, Q provides an expression 
for the capacity in terms of the Verdu-Han framework; 0 
derives upper and lower bounds on the capacity with i.i.d. 
energy arrivals in terms of limits of n-letter maximum mutual 
information rates; Q considers the special case when there 
is a constant amount of energy arriving at each time slot 
and provides an n-letter expression for the capacity in this 
deterministic case; 0 considers the special case where the 
battery is of unit size and the channel is a noiseless bit 
pipe. Characterizing the capacity, even as an n-letter mutual 
information rate, has remained an open problem for general 
energy harvesting processes. The contribution of the current 
paper with respect to this literature can be regarded as: 1) 
providing n-letter expressions for the capacity under various 
assumptions on the available information at the transmitter 
and the receiver regarding the energy arrivals; 2) provide a 
simple and insightful approximation formula for the capacity 
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of this channel by proving upper and lower bounds that 
differ by a constant gap. Our work is most closely related 
to prior work in 0, § which introduces the constant gap 
approximation approach and provides an approximate formula 
for the capacity of the channel in Fig. when the energy 
arrival process is i.i.d. Bernoulli (without providing an n-letter 
expression for the channel capacity). In a companion paper 
p5] , we focus on the special case with Bernoulli energy 
recharges. For this special case, we are able to provide n-letter 
expressions in simpler form, for which the maximizing input 
distribution can be identified in certain cases, and we exactly 
solve the corresponding online power control problem. This 
leads to tighter approximations of the capacity in this special 
case and allow us to deduce new insights on the usefulness 
of noncausal observations of the energy arrivals and output 
feedback (see also ||26l). 

The study of optimal power control policies for the energy 
harvesting channel precedes the study of its information- 
theoretic capacity. The power control problem is well under¬ 
stood in the offline case GD-Gl)- The online power control 
problem can be cast as a Markov Decision Process and the 
optimal solution can be computed numerically using dynamic 
programming pO)-p^. However, the curse of dimensionality 
inherent in the dynamic programming solution makes this 
approach computationally intensive. More importantly, the 
numerical solution provides little insight into the structure of 
the optimal power control strategy, its dependence on major 
system parameters, and the resultant performance. Several 
works focus on establishing properties for the optimal solution, 
however these properties are either very high-level, ex. (22| 
establishes monotonicity of the optimal policy, llz) shows that 
deterministic policies are sufficient; or still require numerical 
evaluation, ex. derives a system of coupled partial integro- 
differential equations as necessary conditions for optimality, 
which can be solved only numerically. It is easy to observe 
that when the battery size is infinite, a simple power control 
strategy that allocates constant power equal to the mean energy 
arrival rate becomes asymptotically optimal and achieves the 
AWGN capacity for any i.i.d. energy harvesting process. Gg- 
19 study the infinite battery regime in more detail. Finally, 
13 and follow-up work propose heuristics without providing 


any guarantees on optimality. In contrast, in this paper we 
propose an explicit online power control policy and show that 
it is within a constant gap to optimality. Indeed, this policy can 
achieve the AWGN capacity, which is achievable with infinite 
battery size, within a constant gap with finite battery. The 
structure of this approximately optimal policy is completely 
different from the heuristics proposed in the literature, which 
are typically inspired by the offline solution or the infinite 
battery regime. 


C. Organization of the Paper 

The paper is organized as follows. Section [I^ contains 
our system model and Section m contains our main re¬ 
sults. Section IV characterizes the capacity of the energy 
harvesting channel as an n-letter mutual information rate. 
Section |V] considers the online power control problem and 


develops an approximately optimal power control policy. The 
reader interested in the power control problem and not the 
information-theoretic capacity of this channel can read this 
section independently. Finally Section|Vl]develops the connec¬ 
tion between the information-theoretic capacity and the power 
control optimization problem. 

II. System Model 

We begin by introducing the notation used throughout the 
paper. Let uppercase, lowercase, and calligraphic letters denote 
random variables, specific realizations of RVs, and alphabets, 
respectively. For two jointly distributed RVs {X,Y), let Px, 
Px,Y, and Py\x^ respectively denote the marginal of X, the 
joint distribution of {X, Y), and the conditional distribution of 
Y given X. Let E[-] denote expectation. For m < n, Xi^ = 
(Xm, X^+i, X„_1, X„), and X" = Xf. With abuse of 
notation, when the superscript is 2 it is understood as square, 
i.e. Xf = (Xt)^. Additionally, when the length is clear from 
the context, we sometimes denote vectors by boldface letters, 
e.g. X G A". All logarithms are to base 2 (In will denote log 
to base e). 

The energy harvesting channel is an AWGN channel, i.e. 
the output at time t is Yt = Xt + Nt, where Nt ~ A/'(0,1) 
and Xi S M is the input. The transmitter has a battery with 
finite capacity B, and the input symbol energy at each time 
slot is constrained by the available energy in the battery. Let 
Bt represent the available energy in the battery at time t. The 
system energy constraints can be described as 

< Bt, ( 5 ) 

Bt = mm{Bt-i - Xf_j^+Et,B}. ( 6 ) 

Et is the energy arrivals process which we assume to be 
i.i.d. for different t. For simplicity we assume that Et is a 
discrete RV over the finite alphabet £, such that Et > 0 
and Et > 0 with positive probability, implying E[£’t] > 0. 
However, our results also apply when Et is continuous or 
comes from a mixed distribution by considering the continuous 
distribution to be the limit of quantized discrete distributions 
(see footnote 1^1. We assume that Bq = b, where 0 < b < B is 
a hxed quantity known to both the transmitter and the receiver 
before beginning of transmission]^ 

In this work, we investigate three cases; Et is observed 
causally at the transmitter only; Et is observed causally at 
the transmitter as well as the receiver; and Et is observed 
noncausally at the transmitter and the receiver. In any case, 
the transmitter has (at least) causal knowledge of Et, which 
implies also causal knowledge of Bt- While it is natural for 
the transmitter to be aware of its own energy arrival process 
in a causal fashion, the receiver may have side information 
regarding the energy arrival process at the transmitter in certain 
scenarios; for example, when it is itself harvesting energy from 
a correlated process or when the transmitter indeed harvests 
the RF energy dissipated by the receiver. 

^This assumption is made to simplify the exposition. Our results hold even 
if the initial battery state is unknown to both the transmitter and the receiver 
and can be arbitrarily fixed or random. 
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For the first case, we define an (M, n, e) code as a set of 
encoding functions and a decoding function 

: >1 X ^ A”, f = 1,..., n, (7) 

( 8 ) 

where A' = 3^ = K and A4 = M}. To transmit 

message w G Ai at time t — the transmitter sets 

The battery state Bt is a deterministic 
function of therefore also of {w,E*). The func¬ 

tions must satisfy the energy constraint 0: 

{fnw,E^))^ <Btiw,E^). 

The receiver sets W = The probability of error is 

1 ^ 

Pr(fF ^ w \ w was transmitted) < e. 

W — 1 

The rate of an {M,n,s) code is We say R is 

^-achievable if for every <5 > 0 there exist, for all sufficiently 
large n, an {M, n, e) code with rate > R — 6. The 

capacity C is the maximal rate that is e-achievable for all 
0 < e < 1. 

When Et is observed also at the receiver (either causally or 
noncausally), ([8]) is altered to x£^ —)■ Ai. Similarly, 

to account for noncausal observations of Et at the transmitter, 
we change Q to : Ad x A'", so that = 

/®“(w,i7"), where again 

{friw,E-))^<Btiw,E^). 

Note that Bt is again deterministic function of {X*^~^, E*'), 
which is now a deterministic function {w,E'^). 

In what follows, we investigate three cases: energy arrival 
process observed causally at the transmitter only; observed 
causally at the transmitter as well as the receiver; and observed 
noncausally at both the transmitter and the receiver. We obtain 
capacities for each of these cases, denoted by ^TxRx'’ 

and respectively. 

A. Equivalent Channel Model with Causal TX Side Informa¬ 
tion 

Consider the channel dehned in the previous section with 
the energy arrivals observed causally at the transmitter. Follow¬ 
ing Shannon’s approach | |28) as done in Q, this channel can be 
converted into an equivalent channel with no state information 
at the transmitter but with a different input alphabet, using 
Shannon strategies: the input to the equivalent channel at time 
f is a function ut : S*" ^ X and the input alphabet for 
blocklength n is of the form 

= {vJ^\ut:£^ ^ X, t = (9) 

Note that is not a Cartesian product of n copies of a 
single alphabet, but a set of n-tuples where each element is 
dehned above. At time t, given the realization of E^, Xt = 
Ut{E*) is transmitted over the original channel. The output 


of the channel is the corresponding Yt G y. This implies the 
following transition probabilities for this new channel: 

PY.^u4ylu-) = Y,PEAenPY^\xAylu-{en) 

gTJ. 

n 

=En PE{et)PY\x{yt\ut{e*))- (10) 

e'^ t—1 

Note that there is no transmitter side information for this chan¬ 
nel and the encoding functions (0 become : A4 W^. 
However, not all n-tuples in are admissible. The energy 
constraints on our original energy harvesting channel imply 
that the admissible channel inputs rt" should satisfy for every 
e" G 

(ut(e‘))^ < 6t, (11) 

bt = min{6t_i - {ut-i{e*~'^) f + et,B}. (12) 

It is easy to see that the capacity of this channel is equal to 
that of our original channel, as coding strategies for one can 
be immediately translated to the other. 

When the energy arrival process is observed only at the 
transmitter, we consider the equivalent channel described 
above instead of the original model. When the receiver also 
observes the energy arrival process, we consider the original 
channel model. 

III. Main Results 

The main result of this paper is the approximation of 
the capacity of the energy harvesting channel under various 
assumptions on the availability of energy arrival information at 
the transmitter and the receiver given in the following theorem. 

Theorem 1. The capacity of the energy harvesting channel in 
bits/channel use is bounded by 

i log(l + p)- 3.85 < < I log(l + y), (13) 

i log(l + /i) - 2.85 < ^ log(l + m), 

(14) 

where p. = E[min{i7t, B}]. 

The proof of this theorem consists of three main steps, each 
of which can be of interest in its own right. The first step is to 
characterize the capacity of the energy harvesting channel as 
an n-letter mutual information rate under various assumptions 
on the availability of energy arrival information (Theorem]^. 
The second step is to use this characterization to show that 
the information-theoretic capacity of the energy harvesting 
channel is within a constant gap of 1.05 or 2.05 (respectively 
for ( foi l and ( [T^ ) from its power control formulation studied 
in the communication theory literature (Theorem Q. The third 
step is to provide an approximate solution to the online 
power control problem with bounded guarantee of 1.8 on the 
approximation gap (Theorem]^. 
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To state the expressions for capacity, we define the set of 
allowed input distributions on for the equivalent channel; 

Vnib) = s.t. a.s. for f = 1,..., n and Ve" G £" : 

(C/*(e*))2 < Bt, Bo = b, 

Bt = min{Bt_i - (C/t_i(e*"^))^ + e*, S}|. 

(15) 

Note that we impose the energy constraints by assigning zero 
probability to any codeword that does not satisfy 0 and ( |T^ . 
Similarly, dehne 

Pn{b) = ^Px^\E^ s.t. Ve" G f", a.s. for t = 1,..., n : 

Xf < Bt, Bo = b, 

Bt=inin{Bt-i-Xt_^ + et,B}y (16) 

For the case of causal energy arrival information at both 

the transmitter and the receiver, we use the notion of causal 

conditioning as in p9) , namely let 

n 

e‘). (17) 

This differs from Px-lS" = I\t=i 

in that at time t the dependence on E'^ is replaced by only 
the past and present E*. Dehne 

Qn{b) = |Tx"||£;" : Px^\E^ = Px^\\e^ s.t. 

Ve" G a.s. for f = 1,..., n : 

Xt < Bt, Bo = b, 

Bt = min{Bt_i - X^_-^ + 

(18) 

This is similar to Pn{b), but imposes the additional constraint 
that Xt must depend on Et in a causal manner, as dehned 
in Note that Bt is a function of {X* ^,E^), so Q„(6) 
is well-dehned. 

Using these dehnitions, we state the following theorem: 

Theorem 2. The capacities of the energy harvesting channel 
with various levels of energy arrival information are given by 

(^ca^usai ^ lim 1 sup Kjj^-Y^), (19) 

C'SRx=lim^ sup J(X";y"|F;"), (20) 

= lim ^ sup I{X^-,Y^\E^), (21) 

where the supremum in should be interpreted as setting 

the input distribution Px^\E'^ix'^\e^) = (3^"||e"), or 

in other words, the Markov chain Xt — {X*~^, E*~^) — E^j^i 
holds for every t = 1,... ,n. 

Although we focus on the AWGN channel in this paper, it 
is straightforward to see that Theorem generalizes to any 
memoryless channel. The proof of the theorem is given in 
Section |lV] 


The expressions in Theorem depend on the initial state 
of the battery Bq = b. However, in the following, we show 
that the capacity does not depend on b, which implies that 
the expressions ([T9l)-(|2T|) can be evaluated for any value of 
b G [0, B] regardless of the actual value of Bo. In fact. Bo 
can even be a random variable or an arbitrary value in [0, i?], 
unknown to the transmitter and the receiver. By “waiting” a 
period of time before starting transmission, during which the 
transmitter remains silent, and which is long enough to charge 
the battery from 0 to B, we can essentially transmit any coding 
scheme designed for any value of Bo. 


Proposition 1. The capacity of the energy harvesting channel 
does not depend on the initial battery state Bo. 


See Appendix A-A for the proof. 

We next turn our attention to the power control problem for 
an energy harvesting communication system that has been of 
signihcant interest in the recent communication theory liter¬ 
ature |[T2)-@. We introduce some new terms and notations 
to dehne the problem. A power control policy for an energy 
harvesting system is a sequence of mappings from energy 
arrivals to a non-negative number, which will denote a level 
of instantaneous power. More precisely, an online policy g" is 
a sequence of mappings 


^ M+ ,t = l,...,n, (22) 


and an offline policy p" is a sequence of mappings 

gt : ^ ffi+ ,t = 1,... ,n. (23) 


An admissible policy is such that satishes the energy con¬ 
straints. Formally, the set of all admissible policies with initial 
battery level b is: 


Gnib) = {5"| s.t. Ve" G £" : 

gt < bt, bo = b, 

bt = mm{bt-i - gt-i + et,B}}. (24) 

We denote by the set of all admissible online 

policies, and by (6) the set of all admissible offline 

policies. 

For a given online policy of length n, we dehne the average 
throughput to be; 


= -E 

n 


^ log{l + gt{E*)) 




(25) 


where the expectation is over the energy arrivals Ei,..., En, 
and similarly for an offline policy, the average throughput is: 


^(5”) = -E 
n 


+gt{E-)) 


t = l 


(26) 


Next, we dehne the following optimization problems which 
aim to maximize the long-term average throughput 


^online 

= lim inf 

max 


(27) 


n—foo 




^offline 

= lim inf 

max 


(28) 


n—fOC) 
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Equations and ( |28] l describe the online and offline 

power control optimization problems, respectively, studied 
extensively in the literaturen In both problems, we want 
to maximize the long-term average throughput subject to 
energy constraints as given in ( |24l i, assuming there exists a 
transmission scheme for which allocating power pt at time t 
yields an information rate r{pt) = | log(l +pt)- While the 
optimal offline power control policy has been explicitly char¬ 
acterized in 1121 - 03, there is limited understanding regarding 
the structure of the optimal online power control policy and 
the resultant long-term average throughput. In the following 
theorem, we characterize the long-term average throughput in 
the online case within a constant gap independent of system 
parameters. 

Theorem 3. The solutions to the online and offline power 
control problems are bounded by 


i log(l + p,)- 1.80 < < T 


-offline^ liog(i + ^)^ 


where p, = E[min{£'(, i?}]. 

While the proof of the upper bound follows from a simple 
application of Jensen’s inequality, to establish the lower bound 
on the throughput we construct an explicit online power con¬ 
trol policy 5 " and show that the long-term average throughput 
it achieves can be at most 1.8 bits/channel use away from the 
upper bound, i.e. 

liminf > I log(l + p)- 1.80. (29) 

n—>-oo 2 

This power control policy has a surprising structure: it waits 
for the battery to be recharged completely and then allocates 
power in an exponentially decaying manner. The proof of the 
theorem and the corresponding approxim ately optimal online 
power control policy are given in Section Vp 


The next step is to connect the two problems discussed 
so far. In particular, we show that the solution of the power 
control optimization problem can be used to lower and up¬ 
per bound the information-theoretic capacity of the channel 
in Theorem which involves a much harder optimization 
problem. 


Theorem 4. The capacities of the energy harvesting channel 
with various levels of energy arrival information can be 
bounded by 


> liminf max , - lH{g-{E-))} 


n—>-00 gn^Qo 


= (b) 


log ( 


7re\ 

tJ 


^causal ^ ^online 


'lonline ^ ^ /^causal ^ /Tnonline 

" 2 V YJ - ^TxRx 

^ /'7rp\ tnoncausal ^ /TnofHine 

TxRx ^ ^ 


T ^ 

^log(y) 


^offline 

2 


(30a) 

(30b) 

(31) 

(32) 


N ote th at \ « 1.05. The proof is given in Sec- 

Note that g'^{E'^) in (|30a[) being a deterministic (and 


VI 


tion 

causal) function of E'^ can be regarded as a random process 
itself and the term ^H{g'^{E'^)) corresponds to the entropy 
per symbol of the first n symbols of this process. Note that 
we can further lower bound OOab to obtain 


T 


online 


-H{E^)-\\og[^]<C: 


causal ^ rponline 
Tx ^ 5 


since g" is a deterministic function of i?" and therefore 
i?( 5 "(!?"■)) < E{{E^). However, the original form of the 
lower bound in ( |30a| l can be significantly tighter than the form 
above since the entropy of the allocated power process 
can be significantly smaller than the entropy of the energy 
harvesting process i?". In particular, we show in Section |V] 
that the online power control policy we develop to achieve the 
lower bound in Theorem has entropy per symbol bounded 
by 1 bit/channel use. This is formally stated in the following 
proposition. The form of the lower bound in ( |30a| l (and 
more specifically its proof) reveal a trade-off in designing 
communication strategies for channels with state information 
available only at the transmitter: while the transmitter knowing 
the state of the channel can follow a different strategy for each 
value of the state, the need to infer the state from the received 
signal can lead to a rate hit proportional to the entropy of the 
state. This can make strategies that have a coarser dependence 
on the state more desirable. 


the literature jl2) - ||l4) , the offline power control problem is typically 
Studied for an arbitrary known sequence of energy arrivals, without imposing 
a distribution on the energy amvals. Note that even with this difference, the 
resultant optimization problems and the corresponding optimal offline policies 
are equivalent, since the offline policy in our current case needs to maximize 
the throughput achieved under any given realization of the process. Imposing 
a distribution on the energy arrivals allows us to have a notion of long-term 
average offline throughput. 

^ In a recent publication |30| , the Fixed Fraction policy is suggested, in 
which gt = pBt, i.e. a fixed fraction of the battery level is allocated at each 
time slot. This policy is shown to achieve the optimal throughput up to a 
gap of only 0.72, by showing that Bernoulli energy arrivals yield the worst 
performance for this policy. This result combined with Theorem which 
is stated next, can be immediately used to decrease the approximation gap 
for and in Theorem]^ to 1.77 from 2.85. However, it 

does not lead to a similar ii^rovement for the approximation gap for 
since, as given by Theorem 4 the gap for depends on the entropy 

per sym bol of the online pmicy. Since the entropy per symbol of the policy 
in |3Q| is equal to H{Et), which can be arbitrarily large, here we devise an 
alternative online policy for which we can simultaneously have the guarantees 
provided in Propositions [^andj^ 


Proposition 2. For the online power control policy g” achiev¬ 
ing ( [29l l, we have 

-7T(Y(£^")) < 1 . 

n 

In particular, this property of 5 " together with \29) yields 
liminf max {^{g^ - ^Hig^iE^))} 

n—yoo 

> ^log(l+/x)- 2.80. (33) 

^ Theorems 00 are stated for Ft that follows a discrete distribution. 
However, as mentioned in Section[T^ these results can be extended to arbitrary 
probability distributions. The exact derivation exceeds the scope of this paper, 
however, we mention briefly that this can be done by considering quantized 
versions of Et and applying Theorems and 1^ In the limit of very high 
resolution, it can be shown that the throughputs j^^ime j'ofTiine converge 
to the throughputs of the original process Et. This, along with the fact that 
Theoremholds for any distribution of Et, yields Theorem^ 






It is immediate to verify this proposition, which we do in 
Section]^ after we introduce the policy p”. 

Note that when the receiver has knowledge of the energy ar¬ 
rival process in ( |3T| i and ( [32l i, the gap between the information- 
theoretic capacity and the long-term average throughput is only 
^ log (^) which is approximately 1.05. It is indeed surprising 
that the actual information-theoretic capacity achieves, within 
a constant gap, the solution of the power control problem. 


In Section VI we suggest one natural way to use the optimal 


power control policy to construct explicit codes which achieve 
the lower bounds in (|30ll-(|3^. This emphasizes the importance 
of the power control problem in understanding the more 
fundamental information-theoretic problem. 

The additive approximations in Theorem obviously be¬ 
come irrelevant at low-SNR since the lower bounds @- 
( |3^ can become negative, and thus useless. This indeed is 
an artifact of our constant gap approximation approach which 
bounds the worst case additive gap between the quantities of 
interest (the worst case gap for all the approximation theorems 
above occurs typically when p and B are very large). It should 
be clear from the proofs of these approximation theorems that 
as the quantities of interest become small, the additive gap 
between them also tends to zero. In order to illustrate this fact, 
in the following theorem we provide a multiplicative relation 
between the information-theoretic capacity and the long-term 
average throughput which is more relevant in the low-SNR 
regime. 

Theorem 5. The capacities of the energy harvesting channel 
with various levels of energy arrival information can be 
bounded by 

C— > liminf max {p • 

ra->-oo 

(34a) 
(34b) 


^causal ^ rjii 


online 


< T 


online 


nonline ^ ^causal 
offline ^ ^noncausal ^ ^offline 


(35) 

(36) 


T ( 

^ "-^TxRx 

T^offline 

g 1 ^ '-^TxRx 

where the parameter g > 0.7473. 

See Appendix for the proof. Again, note that ( |34a| ) can 
be further lower bounded to obtain 


g ■ -H{Et)< < T 


nonline 


Finally, note that combining ( [33] l with the inequalities in 
Theorem we immediately obtain Theorem 


IV. Channel Capacity; Proof of Theorem[2] 

We begin with proof of achievability for the hrst case - 
causal energy arrival information at the transmitter alone, 
and we consider the equivalent channel model developed in 
the beginning of Section II-A We construct an achievable 


scheme composed of k blocks with each block containing a 
codeword of length n which is an element of As such, 
each codeword is a function of only the past n energy arrivals, 
which means we ignore information regarding all the energy 
arrivals in the previous blocks. These codewords are designed 


to satisfy the energy constraints for initial battery level Bq = b, 
so to accommodate this, we must ensure that the battery level 
in the beginning of each block is at least b. To this matter, 
we allow the battery to “recharge” after we transmit each 
codeword by waiting a sufficient amount of time (i time 
slots), during which the transmitter remains silent. If i is large 
enough, the probability of recharging the battery back to level 
b will be high. In the case when the battery is not sufficiently 
recharged at the beginning of the next block, we can simply 
give up on this block and transmit the all-zero codeword. We 
will explicitly show that this will have a negligible effect on 
the achievable rate. 

To make the probabilistic analysis simpler, it is helpful 
to have the different blocks statistically independent of each 
other. Note that subsequent blocks are coupled through the 
battery state. More precisely, the event of whether the battery 
at the beginning of each block is recharged to b or not (which, 
in turn, determines whether a codeword is being sent or just 
zeros) may depend on the amount of energy left in the battery 
at the end of the transmission in the previous block. To 
decouple one block from its sequel, we purposely deplete the 
battery to zero before waiting for it to be recharged. This way, 
the battery level at the beginning of the next block will depend 
solely on the last i energy arrivals. In what follows, we make 
the above description precise. 

Fix P[/n £ Vn{b) and for each message w, generate k 
random codewords independently ~ P;/", * = l,...,fc. 
Recall that each is a function on £^. The chosen message 
w will be transmitted over k blocks, each of size n + i + 1, 
for a total transmit time of k{n + £+l). Hence, we will dehne 
codewords yfc("+^+i) g i/Hn+e+i) ^gjJJg (jjg above v^. 

Each block comprises three parts; the hrst part, of length 
n, consists of the codeword (or an all-zero vector of length 
n if the battery level at the beginning of this block is not 
sufficient to transmit codeword v^). In the second part, which 
takes only one time slot, we deplete the battery to 0. The 
third part consists of £ zeros, which are meant to recharge the 
battery to level b. 

Consider block i, 1 < i < k, which takes place during 
times t = {i — l)(n to t = i{n + £ + \). 

^Strictly speaking, there are no energy arrivals and battery state in the 
equivalent model defined in (H}, but e* and bt in GD can be rather viewed 
as dummy variables where represents the input variables of the function 
and bt is an intermediate variable used to define the input constraint. However, 
we continue to refer to as the energy arrival sequence up to time t and bt 
as the battery state at time t to make the exposition easier. 

^The idea of “erasing” the memory in the battery by using codewords 
interleaved with silent times has first appeared in which considers a 
special case of the problem with constant deterministic energy arrivals. A 
block i.i.d. coding scheme was proposed in when the transmitter has 
causal energy arrival information but 6 = 0, m which case one does not 
need the zero-padding between the codewords to recharge the battery. Our 
achievable strategy is closer to which considers noncausal energy aiTival 
information at the transmitter in me case of Bq = B. However, the proof in 
jT] is incomplete because it assumes that by making the zero padding between 
blocks long enough, we can ensure that the battery is recharged to full each 
and every time. This is not possible, because as the number of blocks k oo, 
recharging failures are inevitable and have to be explicitly taken into account. 
Also, in the noncausal case, the codewords can be constructed directly on 
the original channel with input alphabet X and not as we do here (See 
Appendix]^. In |25| , we show that the noncausal capacity is strictly larger 
and therefore different than the causal capacity. 
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T Bi 

bo,i 


T 

bd,i 


Fig. 3. Stmcture of block i in the coding scheme. 


We define the following notations: &o,i — ^(i-i)(n+£+i) is 
the battery level before the beginning of the block, i.e. the 
initial battery level before we start transmitting codeword 
Vi; Zi = l{;,oi>b} is an indicator, denoting whether the 
initial battery level &o,i is sufficient to transmit the codeword 
V*; bd,i = 5(i_i)(„+i>+i)+„+i is the battery level at time 
{i — l){n + £ + 1) + n + 1, which will be used to deplete 
the battery at this time-slot; Oi = are the 

energy arrivals during the transmission of the codeword vf, 
and e* = the energy arrivals during 

the zero-padding phase at the end of each block. Denote by 0 
a vector of £ zeros. 

Block i of the codeword is constructed as follows: at the 
beginning of block i at time t = (i — l)(n -I- £ -I- 1 ) -I- 1 , 
given for each we can compute 6 o,i = ^t-i- If 

the battery state is at least b, we send the codeword v^, i.e. 
^t+n-i^^t+n-i'^ _ Yj(ei). Otherwise, the transmitter sends 
zeros for n time slots. Thus, the first part of block i can be 
written as Zi • v^. For the second part of the block, which is 
the single time-slot t + n, again given for each 

the transmitter computes the battery state bd^ = bt+n, and 
transmits a/ 6 ^, i.e. ut+„(e‘+") = ^bd,i{e*+'^). This will 
deplete the battery to zero. By purposely depleting the battery 
before recharging it, we remove the dependence between 
different blocks, and make the probability of recharge failure 
an i.i.d. process. Next, the transmitter sends zeros for £ time 
slots, which will recharge the battery to b with high probability. 

To summarize, the transmitted block is 


i(n+l+l) 


(e*("+^+i)) = [z. 


v^(ei), o], (37) 


See Fig. for a graphical representation of the block struc¬ 
ture. Note that since the battery is depleted to zero at the 
second part of block i, Zi+i is a deterministic function of 
e^. In fact, is a deterministic function of 6o,i+i, where 
bo,^+l = ^ir,{B,Y.\=^-X]ln+i+l)+r,+2^t}- 

^k(n+i+i)^^k(n+i+i)-^ defined in ( |J7l i is a well-defined 
element in Moreover, observe that it satisfies the 

energy constraint: this is trivial for For the subsequent 

codewords, if the battery level is larger than b, we assume it is 
b and ignore (waste) the remaining energy. If it is less than b, 
we transmit only zeros. This will satisfy the energy constraints. 

Denote the channel output during the first part of block i 
by Yi = 2/(ti)(«+r+i)+i ■ The receiver observes yHn+e+i) 
but makes use only of = (yi,...,yfc) for decoding, 
by applying standard jointly typical decoding with v^. The 


channel transition probability from to is 

PY''iv''(y^|v*) 

e^tn+c+i) 

k 

= E n -Pe" (ei)PE{e(i_i)(n+e+l)+n+l)P e^ (®i) 

gfc(„+f+l) i^l 

■ Pv"|Js:"(y»k*(ei-i) • V,(e,)) 
k 

= n X! PE"(e*)PEf(ei)PY"|X"(y*ki(ei-i) • v,(ej)) 

i—1 Gi ,ei 

k 

=nE PE^[^i)Pz{Zi)PYr^\X^{yi\Zi ■ Vj(ei)). 

i—1 Gi.Zi 

Note that since Et is i.i.d., Pe^ and Pz do not depend on 
so this is a memoryless channel with transition probability 

PY|v(y|v) = ^PE"(e”)Pz(z)PF"|v-(yk'v(e”)) 

,2 

= '^Pz{z)PY^\u^{y\z ■ yr). 

z 

where the last step is from ( [TOl i. Note that Y = Y” is the 
output of the channel from (7" to Y" with the input multiplied 
by an independent Bernoulli RV Z. 

Denote Pz(0) = a. Taking fc —)• oo, we get by standard 
joint typicality arguments that rate /(V; Y) is achievable. The 
following holds: 

/(V;Y)=/(V;Y,Z)-/(V;Z|Y) 

>/(V;Y|Z)-iT(Y) 

= (l-a)/(V;Y|Z=l)-i72(a), 

where iT 2 ( ) is the binary entropy function. The last step is 
because /(V; Y|Z = 0) = 0. 

Note that PY|v.z(y|v,z = 1) = PviE"(y|v) and, by 
construction, V ~ Pun independent of Z. This implies 

/(V;Y|Y = 1) = I{U’^;Y'^). 


For £ large enough, a can be upper bounded using the law 
of large numbers a = Pr{X]t=i Et < b} < eg, where 
limf_>oo ££ = 0 (recall that E[£'t] > 0), s.t. for every n > 1 
we have 


/^causal 'V. 
^Tx ^ 


n + £-\-\ 


H2{ei) 


Since Pu^ is an arbitrary input distribution in Vn{b), we can 
take the supremum to obtain 


C causal 
Tx 


> sup 

Pur^eVnib) 


(l-£^)J((7";Y")-g2(£r) 

n + £+1 


Let £ = [logn]. Taking n —t oo, we get 

(^ca^usai > limsup 1 g^p (38) 

n—j-oo 


^ Z\ is an exception, since it equals 1 w.p. 1. Nevertheless, one can 
artificially generate a Bernoulli RV and choose whether to transmit vi or 
zeros according to the outcome. 
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For the converse part, we use Fano’s inequality as in pT). 


An (M, n, e) code for the channel dehned in Section 11-A 
satishes 


H{W\Y'^) < H 2 {e) + e\ogM 
(1 - e) logM < I{W- y”) + H 2 {e) 

If R < is achievable, then for every <5 > 0, 

R-S <--^[IiU^-Y^)+H2ie)], 
nl — £ 

where F") is the mutual information evaluated for Pu^ 

induced by the code. Since all codewords must satisfy the input 
constraints and E^, this implies Pf/r. G P„(&) (see (fTS])). 
Therefore 


nl — £ 

which implies 

liminf- sup I{U^-Y^). 

l — £ "-i-oo n 

Taking e —>■ 0 gives 

(^causal ^ limij^fl sup UU^'Y^). (39) 

Together with ( [38l l, this implies that the limit exists and is 
given by EB- 

We now turn to the case of energy arrival information 
available causally at the transmitter and the receiver (eq. @). 
We can repeat the previous steps in exactly the same manner, 
and since the receiver now observes P” as well, we simply 
add it alongside F". All the arguments still hold, and we are 
left with 

hm - sup /(P";F”,P") 

n->oo n 

i lim - sup /(P’^;F”|P") 

n->oo n pj^„gp^(f,) 

= lim - sup /(X’";F”|P") 

n->-oo n 

lim - sup /(A:";F"|P"). 

where (i) is because P” is independent of P”; (ii) is because 
a:" = P"(P”) and the Markov chain P” - (AT”, P”) - F”; 
and (iii) is because, as will be shown below in ( |40| i, any distri¬ 
bution Pp». G Vnib) induces a distribution Px^\\e^ G Qn{b) 
on F", and any X" ^ Px^yp" G Qn{b) can be represented 
as a random function of P” according to some distribution 
Pu^ G Pn{b). This gives ( |20| ). 

To show (iii), observe that the joint distribution Px”^,e^,u^ 
can be factored as 

n 

= Y]_PUi\U'-^{Ui\u"~^)PE{ei)l{Xi = Ui{P)}. 


sup /(P";F’^) + P 2 (£) , 
Purser ^(b) 


Summing over Xn and then over u„ gives 

n —1 

= n Pui\ui-^{ui\u''~^)PE{ei)l{xi = Ui{P)} ■ PE(e„). 
i=l 

Continuing to sum over x„_i, u„_i, and then a;„_ 2 , u „_2 and 
so forth yields for any t: 

Px\E^,u* 

t 71 

= WPui\u-^iuiW'‘~^)l{xi = Ui{P)} • J|P£;(ei). 
Summing over e^j^i gives 

t 

= 'WPu,\u--^{uiW~^)PE{.ei)l{xi = Ui{P)}, 

hence 

n 

Px‘.P",c/*(a;*,e",u*) = Pxt,p*,(7t(a;*,e*,u*) • Pp(ei). 

Summing over u* gives 

71 

Px\E^{x*,e'^) = Pxt,E*{x*,e*) Ppjei). 

This implies 

-Px*-i.p‘(a:*"\e*) = Pxt-i.pt-i (x‘"\e‘"^)PE(et). 
Using all of the above identities, we have; 

Pxt\xt-\E^{xt\x*~^,e'^) 

^ PyGp.(cr*,e") 

^ Px*,E^{x*,e*)U7=t+iPE{ei) 

Pxt-\Et-^ (p-i, e*-i) nr=t Psisi) 

^ PyGP*(a:*,e*)nr=.t+i-Pg(eO 

Px*-\E* {x*-\e^) nr=t+i Psiei) 

= Pxt \ x <^-'^, E * ixt \ x * ^,e*). (40) 

This implies Px"|pr.(x"|e”) = Px^yp™(a;"-||e"). 

The proof of the remaining case (namely energy arrival 
information available at the transmitter and the receiver non- 
causally) follows exactly the same lines and appears in Ap¬ 
pendix 1^ 

V. Optimal Online Power Control; Proof of 
Theorem[3] 

In this section, we consider the power control problem for 
the energy harvesting communication system. This problem 
was formally dehned in (|2^-(|28|) in the two settings of 
interest, offline and online. Here we focus on the online version 
of the problem. To recall, the goal is to hnd an optimal online 
power control policy 

^ K+ = 1,... ,n, 











II 


that satisfies the energy constraints, i.e. it belongs to the set 

^online (^) ^ | | ^ ^ . 

gt < bt, bo = b, 

bt = niin{&(_i - gt-i + et, B}] ■ 
and maximizes the long-term average throughput of the system 
T°niine = liminf max (41) 

n-foo g"Gp°"'‘"'‘(b) 

We start by deriving a simple upper bound on the throughput 
which serves as a benchmark for the online polices we 
construct in the rest of the section. 


A. Upper Bounding the Throughput 

The upper bound we develop in this section not only holds 
for which is of main interest in this section, but 

also for defined in ( |28l l. First note that without loss 

of generality, we can replace the random process Et with 
Et = min{£'t,i3} without changing the system. This is due 
to the fact that whenever an energy arrival Et is larger than 

B, it will be clipped to at most B. Denote p = For 

any n and any policy p", we have: 


^( 5 ”) = 


(i) 

< 


(ii) 

< 


1 "■ 
-Ve 

n 


log {I+ gt{E^)) 


llog(l+lE[^g,(E")]j 
llog(l + iE[B + |:4]j 
ilc,g(l + iiS + g) 


where (i) is by concavity of log; (ii) follows from the fact that 
the total allocated energy up to time n, can not exceed the 
total energy that arrives up to time n plus the energy initially 
available in the battery. 


'^gt<B + ^Ef 

The last term tends to 4 log(l + p) as n —)■ c». Note that this 
is true for both offline and online power control policies, and 
for any energy arrival process Et- We therefore have: 

^online < ^.offline < 1 (42) 

where p = E[min{i?t, i?}], which proves the upper bound in 
Theorem [3 


B. Approximately Optimal Online Power Control Policies 

We next turn to developing explicit online power control 
policies that can provably approach the upper bound developed 
in the previous section. While we are interested in policies that 
perform well for any arbitrary i.i.d. energy harvesting process, 
our development is inspired by the approximately optimal 
online power control policy for i.i.d. Bernoulli energy arrivals 


developed in Q. We next overview this policy and prove its 
approximate optimality in a somewhat simpler manner which 
also leads to a slightly better gap (0.721 bits/channel use as 
opposed to 0.973 bits/channel use in Q). Our analysis for the 
general case leverages on this derivation. 

1) Bernoulli Energy Arrivals: Assume the energy arrivals 
Et are i.i.d. Bernoulli RVs: 


Et = 



w.p. p 
w.p. 1 - p, 


i.e. at each time t either the battery is fully charged to B with 
probability p or no energy is harvested at all with probability 
1 -p. 0 proposes the following online power control policy 
for this system: Let jt{E*) be the time of the last energy 
arrival, i.e. 


jt{E*) = {sup T <t-. Er = B}. 


The policy is defined as follows: 

gtiE*) = Bp{l-pY~^*. 

With this policy, the amount of energy we allocate to each 
time slot decreases exponentially with the time since the last 
battery recharge (or equivalently energy arrival). Note that this 
is clearly an admissible strategy since 

OO 

k=jt 

i.e. the total energy we allocate until the next battery recharge 
can never exceed B, the amount of energy initially available in 
the battery. Another way to view this strategy is that we always 
use p fraction of the remaining energy in the battery at each 
time. Note that the energy in the battery decays like Bt = 
(1 — pY~^*B. The motivation for this power control policy 
can be understood as follows: for the Bernoulli arrival process 
Et, the inter-arrival time is a Geometric random variable with 
parameter p. We know that the Geometric random variable is 
memoryless and has mean 1/p. Therefore, at each time step, 
the expected number of time steps to the next energy arrival 
is always 1/p. Since log(-) is a concave function, we would 
ideally want to allocate the energy as uniformly as possible 
over time, i.e. if the current energy level in the battery is bt 
and we knew that the next recharge of the battery would be 
in exactly m channel uses, we would allocate bt/m energy 
to each of the next m channel uses. For the online case of 
interest here, we do not know when the next energy arrival 
will be. Instead, we use the expected time to the next energy 
arrival: since at each time step, the expected time to the next 
energy arrival is 1 /p, we allocate a fraction p of the currently 
available energy in the battery. Fig. illustrates this power 
control policy. 

Before moving forward to establish the approximate op¬ 
timality of this power control policy, we provide a few 
definitions and results from renewal theory. 

Definition 1. A stochastic process {Xt}'^^ is called a non- 
delayed regenerative process if there exists a random time r > 
0 such that the process {Xr+t}tYi has the same distribution 
as {Xt}^i and is independent of the past (t,X'^). 
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Fig. 4. The approximately optimal online power control policy for Bernoulli 
energy arrivals. 


Observe that a regenerative process is composed of i.i.d. 
“cycles” or epochs, which have i.i.d. durations ti, r 2 ,.... At 
the beginning of each epoch, the process “regenerates” and 
all memory of the past is essentially erased. The following 
theorem establishes an important time-average property of 
regenerative processes. 


Theorem 6 (LLN for Regenerative Processes). Let 
Xt € X, be a non-delayed regenerative process with associ¬ 
ated epoch duration t, and let f : X ^ M.. If Er < oo and 
EELi \fi^t)\] < oo then: 


1 1 

lim -T,fiX,) = -E 




E/w) 


,4=1 


This is an immediate consequence of Theorem 3.1 in [31] 
Ch. VI] or of the renewal reward theorem |32 Prop. 7.3]. 

Equipped with this theorem, we now consider the process 
gt{E*) obtained by our power control policy for Bernoulli 
energy arrivals. We show in Appendix |A-B| that the initial 
battery level is irrelevant to the long-term average throughput 
(similarly to Proposition this follows from the fact that 
we can always wait until the battery recharges to B before 
starting transmission, with a vanishing penalty to the average 
throughput). We can therefore assume without loss of gen¬ 
erality that Bq = B, which necessarily implies Bi = B. 
Hence we can equivalently assume Ei = B. Denote by L 
the random time between two consecutive energy arrivals. 
Evidently, L ^ Geometric(p). That is. 


Pr(L = k) = p{l — p) 


k-l 


,k=l,2,... 


Observe that gt{E*) is a non-delayed regenerative process 
with epoch duration L. We apply Theorem with f{x) = 
\ log(l-|-a;). Note that EL = 1/p < oo and E[^^^ \ ^ log(l-|- 
gt{E*)\] < E[L • i log(l -f B)] < oo, so the conditions of the 
theorem are satisfied. We obtain 


1 ” 1 

lim -log(l-f 5 t(L;*)) 


n—^oo n ^' 2 

4=1 


= —E 
EL 


^-log(l-f 54(i^*)) 


a.s. 


(43) 


We proceed to lower bound the average throughput obtained 
by our suggested power control policy, which is itself a lower 
bound to 


T 


online 


1 "" 

> lim inf — E 
n—>-oo n 


-log{l + gt{B)) 


(i) 

> E 


= E 


11 

lim inf - V - log(l + gt(L‘)) 

n—>oo n 2, 

4=1 


1 

EL^ 


rE 


1 


^ -log(l-f 54(£^‘)) 


.4=1 


= —E 
EL 


1 ’—E 
EL 


(iv) 1 

^EL® 
^ EL®^ 


^„log(l + 5,(L;‘)) 


t-1 

■ L 


'^:.^og{l + Bp{l-p) 


i-l\ 


.i=l 
■ L 

E 

. 2=1 

l 1 log(l + pB) + 1 iog(l _ p) 


^ log(l -f pL) -f (i - 1) i log(l - p) 


= ^^og(l-hpB) - ^ 


E[L 2 ] 

EL 


- 1 log 


1 

1 -p 


(V) 1 


= 2^og(l+pB) - - 


1 / 2 — 3p -f p^ 


(1 -p)p 


- 1 log 


= \ log(l + pB) - log 


1 -p 


(44) 
1 

1 -p 

(45) 


where (i) is by Fatou’s lemma |33 Thm. 1.5.4]; (ii) is due 
to (EH; (iii) is by definition of the power control policy; (iv) 
is due to the inequality log(l -f ax) > log(l -f a;) -t- log a for 
0 < a < 1; and (v) is because L ~ Geometric(p). 

The second term in the above expression achieves its max¬ 
imum when p —> 0, in which case it is given by ~ 0.72. 
We conclude that for Bernoulli energy arrivals: 

T-“>^log(l + p)- 2 i^, (46) 


where p = E[min{Lt, B}] = pB is the average energy arrival 
rate of the Bernoulli process. 

2) General i.i.d Energy Harvesting Processes: We next 
turn to developing approximately optimal online power control 
policies for general i.i.d. energy harvesting processes. A simple 
extension of the Bernoulli power control policy presented in 
the previous section to general i.i.d. processes was proposed 
in 0 - 0 also showed, via providing examples, that this 
extension can achieve the long-term average throughput within 
a constant gap for some i.i.d. processes, but it would fail to do 
so for some others. 0 however did not explicitly characterize 
the distributions for which this extension achieves the long¬ 
term average throughput within a constant gap. Below, we 
first overview the policy proposed in calling it the binary 
quantization policy and then show that its gap to optimality 
depends on p/B. In particular, the gap becomes unbounded 
when p/B 0. We then construct a new online power 
control policy, which we call the generalized Bernoulli policy 
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which approximately achieves the long-term average through¬ 
put when ^/B is small. Considering these two policies in the 
large and small ^/B regimes respectively proves Theorem]^ 

The Binary Quantization Policy: Let Et be an arbitrary 
i.i.d. energy arrival process. Note that as before, given a battery 
size B we can concentrate on the equivalent process Et = 
mm{Et, B} with mean ^ Let the complementary 

cumulative distribution function (ccdf) F{x) = Pr{£^t > x}, 
X G [0, i?]. Consider the following policy: hx an energy 
level X. Denote by g' the probability of observing an energy 
arrival at least x, i.e. g' = E(x). Then, apply the exponen¬ 
tially decreasing strategy described above as if Et is i.i.d. 
Bernoulli with levels {0,a;} and probability g'. Effectively, 
when Et > x, we treat the incoming energy as a packet of 
size X and ignore the remaining energy. Alternatively, when 
Et < X, we ignore the incoming energy completely and 
assume no new energy has arrived. This of course is an 
admissible policy, but may be highly suboptimal as it ignores 
part of the incoming energy. Nevertheless, we will see below 
that, for a certain class of energy arrival processes, it differs 
from the upper bound only by a constant gap. 

To begin, we apply ( |46l ) to lower bound the rate obtained by 
this strategy: >^log (l + xF{x)) — i^^. The tightest 

lower bound will be obtained by maximizing over x G [0, B]: 

T-ii- > i log fl + mt,x {xF{x)}) - 

2 \ a;G[0,B] / 2 In 2 

We will now upper bound the gap of this policy by providing 
a lower bound of the form 

max xF{x) > c/i, 

[0,B] 

for c G [0,1]. To this end, we introduce the Lambert W 
function, which is dehned as the solution to z = W{z)e^^^\ 
This function is double-valued on (—1/e, 0) - that is, for 
z in this interval there are two possible solutions for the 
above transcendental equation. Specihcally, we are interested 
in the lower branch, denoted by W-i{z), which is dehned 
for z G [—1/e, 0). This function is strictly decreasing, and it 
decreases from W-i{—e~^) = —1 to lim^/iQ W_i(z) = —oo. 
We now show that 



is one such c. Note that since Et is of bounded support [0, B], 
we always have 0 < fji/B < 1, so the above expression is 
well-dehned. Also note that the dehnition of c* implies 

=-e ' , 

B e* 

1 - c* = cMn . (48) 

Now, suppose xF(x) < c*n, \/x G [0, B]. We have: 

fS _ 

P = j E{x)dx 

Jo 


nC* ti I'B 

< dx+ F{x)dx 

Jo J C* fl 

< c fj, + / ——dx 

Jc-tL X 

= c*n + c*n\n 

This yields c* -f c*ln^^^ > 1, which contradicts the 
dehnition of c* ( |48] l. 

Finally, we obtain: 

> ^log(l + Ai)-^log(|:) , (49) 

where the second step is due to the inequality log(l + ax) > 
log(l -I- x) -f logo for 0 < Of < 1, and c*{fj,/B) is dehned 
in 

Note that the gap of the policy to the upper bound 
i log(l + n) depends on the parameters of the problem n and 
B through c*. One observes that c* —0 as p/B —^ 0, making 
the gap unbounded. This suggests that the binary quantization 
policy does not work well for small p/B. For such distri¬ 
butions xF{x) can be much smaller than p, which implies 
that a signihcant amount of incoming energy is discarded by 
the policy. This indeed is the case for the counterexample 
presented in Q Section VI.C]. In the sequel, we will present 
a more interesting generalization of the Bernoulli policy, which 
achieves a hnite gap for the range of small /r/B. Choosing the 
appropriate policy out of the two, depending on the value of 
/i, will provide a bounded gap for all values of /i S [0,B]. 

The Generalized Bernoulli Policy: Let g = fi/B, where 
recall that fi = E[Bi] and Et = min{Bt,B}. Note that 
p € (0, B] so g G (0,1]. Consider the following energy 
allocation policy: 

gt = Bqil-g)*-^^=^i{l-g)*-^\ 

where 

St = St{E*,g*~^) = {sup T <t: Br = B}. 

That is, St is the last time the battery was completely full. This 
is clearly an admissible online power control policy, since, 
as before, even if the battery never gets recharged, the total 
energy used will not exceed B. 

Notice the similarity between the scheme in the previous 
section and the generalized Bernoulli policy: we transmit 
using an exponentially decreasing power allocation policy, 
and “restart” whenever the battery recharges completely. In 
the Bernoulli case, the event of battery recharge was an 
i.i.d. process depending solely on the energy arrivals. Here, 
it depends both on the energy arrivals Et (which have an 
arbitrary distribution) and on the sequence of powers gt- See 
Figure 

By construction, the sequence gt is a regenerative process; 
on the event of battery recharge, the power control policy 
“restarts”, and by the i.i.d. nature of the energy arrivals, the 
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Fig. 5. The generalized Bernoulli power control policy for general i.i.d. 
energy arrivals. 


portions of the process between consecutive battery recharges 
are independent and identically distributed. This lends to a 
similar analysis as in the Bernoulli case, and we can lower 
bound the average throughput following the steps in the 
previous section to obtain an analog of (|44|: 


yo„line> liog(i + ^) 


1 /E[L2] 
4 EL 




where L is the time between consecutive battery recharges. 
However, L is no longer geometrically distributed, but has 
some distribution that depends on Et and gt- Hence, to have 
applied Theorem in this case to obtain the above lower 
bound, we need to verify that L has hnite expectation; this 
is done in Appendix]^ 

We wish to upper bound the gap from ^ log(l + g): 


gap=i 



log 



To this matter, consider the first epoch (or cycle) of the 
regenerative process gt{E*). The length of the epoch L can 
be dehned as the hrst time for which Sl+i = B (i.e. 
the battery at the beginning of the following epoch is fully 
charged). This implies that Bt < B for f = 2,... ,L (recall 
Bi = B), therefore Bt = Ht-i — Qt-i + Et- Substituting 
gt = Bq{l — we obtain; 


t 

Bt=B{l-qY-^ 

2 = 2 


Denote St = X]i=2 Hence, L is the hrst time for which 
Bl- gL+ El +1 > B, or: 


L = {inf t: St>B[l-{l-qy]}. 


This is a stopping time adapted to Et+i, so by Wald’s hrst 
and second identities (cf. Theorems 4.1.5 and 4.1.6 in ||^): 


E[S'l] = el • /r, 

E[{Sl - gEf] = ct^EL, 

where = Var(ij(). We obtain; 

EIL^] = ^ (a^EL + 2 hE[LSl] - Els'!]). 


(50) 

(51) 


Next, by dehnition Sl-i < B[l — (1 — q)^ ^], and since 
Et < B, 

Sl = Sl-1 + El+i < B[1 - (1 - qf-y +B<2B. 


Moreover, 


E[Sl] W (E[5i])2 (,) 


(iii) 




EL - EL 

where (i) is true for any r.v.; (ii) is due to ( |50l l; and (iii) is 
because L > 1. Plugging these two inequalities in (|52|) yields; 


E[L2] 

EL 


, E[LS,] 


E[S'| 


< + 4:fiB - g,^). 


Next, observe that since 0 < Et < B, then Ef < EtB, and 
therefore E[£'(^] < gB, or < g{B — g) (This result, in a 
more general form, is called the Bhatia-Davis inequality |[34|). 
We conclude that 

E[L2] 


EL 


<5^-2=5-2, 

g q 


which yields 


^ 5 - 39 , 

gap < —-log 

49 


1 

l^q 


T 


liog(i + ^)-i^log 


1-9 


Observe that the gap is hnite for 9 —>^ 0. 
Finally, combining (|49]l and (|5^, we have; 


- log(l + g)- T™'*- 


< max min 
0<9<1 


c*(9)’ 


e 5 — 39 


49 


log 


(53) 


1-9 


< 1.8034, 

which completes the proof of Theorem 


C. Entropy of the Power Control Policies 

In the light of ( |30a| i in Theorem we care not only 
about the long-term average throughput achieved by a cer¬ 
tain online power control policy, but also its entropy per 
symbol ^H{g'^{E^)), which determines the gap between 
the information-theoretic capacity and the long-term average 
throughput in the case where the receiver does not have 
energy arrival information. We next show that the per-symbol 
entropies of the power control policies 9" we developed in 
the previous section can be bounded by 1 bit/channel use, 
and in this manner prove Proposition This is due to the 
structure of these processes and their regenerative nature. All 
the randomness in the processes ^"(L^”) is contained in the 
epoch start times; knowing where the epochs begin is sufficient 
to generate the corresponding for all the policies 

discussed in the previous section. 

We start with the policy for Bernoulli energy arrivals 
discussed in Section IV-B II Observe that 

-7L(9"(L;")) < -7L(L;") = HiEt) = H2 {p), 
n n 


(52) 
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which combined with < |45] > implies 

liminf max {^(5”) - ^H{g'^{E'^))} 

n—>-<50 gn^g online 

> ^\og{l+pB) - “ H2{p). 

The expression log(l — p) + H 2 {p) is upper bounded by 
1.5242 for all p € [0,1], hence we have 

liminf max - iiJ(p"(£;"))} 

n-foo " 

> ilog(l + p)- 1.5242, (54) 

where recall that p = pB for the Bernoulli process. 

Next, we bound the entropy of the binary quantization 
policy. Define E'^ = l{Et > x} and observe that for this 
policy 

^H{g-{E-)) = ^H{g-{E'-)) < ^H{E’-) = H2{F{x)). 

Then, similarly to ( |5^ , we have: 

> ^log(l + p) - ^ log ^ - 1.5242, (55) 

where q' = F{x) as defined in Section 

Finally, we focus on the generalized Bernoulli policy. Define 
the indicator process Ft = l{Bt = B). By construction, F* is 
enough to determine gt{E*), hence 

^H{g-{E-)) = iH{g-{F^)) < ^H{F^) < 1. 

We therefore have from ( |5^ : 

liminf max {g^) - {g'^{E^))} 

n->-oo 

>llog(l + rt-^log(^)-l. (56) 


V-B2 


A. Upper Bounds 

We start with Note that since C^^usai < we 

will obtain both ( |30b| l and the upper bound for ( |3T| l. Through 
some algebraic manipulations, we show in Appendix that 
Qn{b), defined in ( [TSl l, can be written as 

Qn{b) = |Px"||£;" s.t. Ve" G £", a.s. for f = 1,..., n: 

t t 

Y^X]<B+ Cj ,i = l,...,f, 
j=i i=i+l 

1=1 1=1 

Next, we define another set of probability distributions by 
relaxing the a.s. constraints to hold in expectation: 

Qn{^) = s.t. for f = 1,... ,n and Ve" G f": 

t t 

< B + Y 

,Z = 1, . . . , t, 

Y^[X^\E^=e^]<b + Yej}- 
1=1 1=1 

Observe that Q„(6) C Q* (6). 

By the same arguments as before, we can write ( |24| ) for the 
set of online policies as: 

= {g- = (pi,... ,5„), gt-.S*^ K+, 
s.t. Ve" G £" : 

t t 

Ydji^^) ^ B + Y = 

t t 

Ydji^^) + ,f = l,...,n|. 

1=1 1=1 

(57) 

Next, we upper bound the mutual information in ( |20l i as 
follows: 


Now, combining ( |55l l and ( |56l l: 
ilog(l + p)-liminf max - ii7(p"(i7"))} 

< max min | ^ log —^ + 1.5242, ^ log ( —-— ) + 1 

- o<g<i [2 c*{q) 4g ^ \1 - q J 

< 2.8034 

which, along with ( |4^ , gives the result of Proposition 

VI. Connection between Capacity and 
Throughput: Proof of Theorem|4] 

In this section we upper and lower bound the capacity of the 
energy harvesting channel, given by the expressions in The¬ 
orem 1^ using the solution to the power control optimization 
problem. 


= h{Y'^\E’^) - h{Y’^\X^,E^) 

n 

<YHYt\E^)-KYt\Xt,E^)] 

\ =Yl{Xt,Yt\E^), 

t=i 

where the inequality is due to the memorylessness of the 
channel. Applying this to ( [20l i gives: 

CtxRx = lim - sup 

1 " 

< liminf— sup 'S^ I{Xt,Yt\E*') 

1 " 

< liminf— sup 'Y I{Xt',Yt\E^)j (58) 
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where the last inequality is because Qn{b) C Q* (6). Note that 
we always have 


t=l 


I {Xu Yt\E^ = e‘) < ^ log (1 + E[Xf\E^ = e‘]) , 

since the mutual information of the scalar AWGN channel is 
always maximized by a Gaussian input distribution. Taking 
expectation, we obtain: 


I{XuYt\E*)<E 

Plugging this into (|58ll yields: 


C— < liminf 


-log{l + E[Xf\E^]) 


1 


sup E 


^-log(l + E[x 2|i?‘]) 


For a fixed Px'^we^ G Qni^) ^ denote 

5 t(e*) = E[X^\E* = e% t = Then g" e 

as given by Therefore, 


C'tx’rx^ < lim inf — max E 
n^oo n g"Ge°"““(6) 


1 

^-iog(i + p,(T;*)) 


= liminf max 

n-)-00 pn^gonline(5) 

_ rponline 


n 

= ^P£;^(e-)^/(Xt;r*|T;‘ = e‘). 

t—1 


Observe that I{XuYt\E* = e*) is in fact the rate obtained 
for a scalar AWGN channel when the input distribution 
is We can therefore maximize over all such input 

distributions to obtain the following lower bound for (|20|: 


> limsup - 5 ] PE^en E CsmitMe^)) 


n—>-(x> ^ 

e 

= lim sup — E 

n—yoo ^ 


Y^CsmitME^)) 


(59) 


where 


Csmith(^) = max/(X; X + N), 


for N ^ JY{0, 1) independent of X. This is the capacity of 
the amplitude constrained scalar Gaussian channel, which was 
found in p5) (hence the notation Csmith)- Unfortunately it 
is not tractable, however, it can be lower bounded using the 
following lemma. 


This gives ( |30b| l and the upper bound in pT] ). The derivation 
of the upper bound in ( |3^ is similar, and is shown in 
Appendix 

B. Lower Bounds 

We derive here the lower bounds in ( |30| ) and ( [3T| l. The 
derivation of the lower bound in ( [32l l is similar, and is deferred 
to Appendix 

1) Energy Arrival Information at the Transmitter and the 
Receiver: We start with Fix p’" e p’" will 

determine an energy allocation policy for transmission, and at 
time t = 1,... ,n we transmit a symbol with a peak power 
constraint of gt{E*). More precisely, for every S G [0, B], fix 
a distribution with support [—V^, V^]. We construct an 
input distribution of the form 

n 

where Pxt\Et{xt\e*) = Since Xf < gt{E*) and 

p" is an admissible online power control policy, the energy 
constraints are satisfied completely. This is clearly suboptimal, 
since most likely for some t, Xf < gt{E*), therefore energy 
will be wasted. Still, Px^ys" G Qnib), and thus we can obtain 
a lower bound by computing the mutual information in ( |20| ) 
for Px^iiB"- Note that the Xfs are independent given P”. In 
fact, we have: 

n 

Px"^yr.|£;r,{x",2/"|e"') = Pxt|£;‘(xt|e‘)Py|x(?/t|x4), 

t=i 

therefore, 

n 

I{X^-Y^\E^) = Y,I{XuYt\E^) 

t=i 


Lemma 1. The capacity of the amplitude constrained scalar 
AWGN channel with noise variance 1 can be lower bounded 
as follows: 

1 1 / TTP \ 

Csmith(5') > 2 log(l + -S') - 2 log ( y) ■ (60) 

This is the same as Lemma 1 in 0. the proof of which 
relies on results from | |3^ . For completeness, we bring here a 
simple proof using the entropy power inequality. 

Proof: Let X be uniform on the interval [—v^, V^]- 
Then, using the entropy power inequality: 

I{X] X + N) = h{X + N)- h{N) 

> ^ log - h{N) 

= ^ log(4S' + 27re) - ^ log(27re) 

= llog(iY) (W 

>^log(l + P)-ilog(^). 


Plugging ( |60| into ( [59l ): 


C" > lim sup -E 

n—f<x> ^ 


log{l + g^{E^)) 


> lim inf fr (5") - log (. 

n—>-oo Z \ 2 / 



(62) 


Since this is true for any p" G we can take the 

maximum to obtain: 


PtxRx > liminf max .^(g^) 
n—>-oo 
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which gives the lower bound in pT] ). 

Applying similar arguments for obtain the 

LHS of ( [3^ . This is shown in Appendix [P] 

2) Energy Arrival Information at the Transmitter Only: 
We continue to the derivation of the lower bound on 
namely P0a[ l. Fix p" S We construct an input 

distribution Pjj^ S Vnib) that consists of independent strategy 
letters: 

n 

-Pc/"(m") = 

t=i 

Recall that each strategy letter is a function Ut ^ X. 
Therefore, it can also be viewed as a vector in X^^^ . For 
each element of the vector, corresponding to each realization 
of we will choose the same distribution as in the previous 
case, namely: 

C/,(e‘) ~ 

This will induce the same conditional distribution Px^\\E'^ on 
Xn — that was constructed previously. Additionally, 

since p” G this input distribution is admissible, i.e. 

-Pc/-* e Pn(b). 

We can lower bound I{U"; F”) as follows: 

7(17”; F”) = 7(17”; F”, E^) - I{U^-, 7;”|F”) 

= 7(17”; F”|7;”) - 7(17”;7;"|F”) 

> 7(77”; F”|7;”) - 77(71”), (63) 


where (i) is because 7/” is independent of 77”; and (ii) 
is because 77” = 17”(71”) and the Markov chain 17” — 
(77”, 77”) —F”. Since 77” is distributed according to the same 
-Px" l|£;n as before, we get 

F”) > 77 ( 5 ”) - 1 log (’IJ) - ^77(77”), 

which, after maximizing over 5 ” G ( 6 ) and taking 

n —/ 00 , gives: 

(^causal > J^online _ ^ 


It turns out, however, that this bound may be too loose. The 
term 77(77t) may become very large for different distributions 
of Ef, and is in fact unbounded for increasingly large alpha¬ 
bets S. Intuitively, this gap implies that the receiver must learn 
the entire sequence of energy arrivals 77” in order to know the 
codebook from which the transmitter chose the codeword. For 
example, the rate corresponding to (|63 can be achieved by 
communicating the sequence of realizations of 77” at the end 
of each block which will induce a rate penalty equal to the 
entropy rate of this process. However, this requirement can be 
made less strict by observing that our desired input distribution 
at each time depends only on 5 t(e‘) - a deterministic function 
of eP By introducing special structure into 5 ”, we can make 
its entropy per symbol -H{g^{E^)), which is the amount of 
information that needs to be sent to the receiver, much smaller 


than 77(77(). See Section V-C 


Fix an online power control policy 5 ” G ( 6 ). We 

wish to construct Pjj^ in such a manner that gt{e*) alone will 
determine 77^. This implies that for two different energy arrival 


realizations, say e* and e*, that satisfy 5 t(e*) = 5 t(e*), we wish 
to have t/t(e*) = Ut{e*) with probability 1 . 

Since Ut can be thought of as a vector of size \£\*, we wish 
to specify the joint distribution of this multivariate random 
variable. For that matter, dehne the set of all possible outcomes 
of the power control policy at time t: 

Gt = {9 ^ K-i-l 9 = St(e*), e* G £*}. 

This set dehnes a partition on the set 7*, in the sense that 
disjoint subsets of £* map to different 9 G Gt- More precisely, 
let 

A(s') = {e* e 7*1 5 t(e‘) = g}. 

Then At{g) for different p’s are disjoint and 7* = 

UgGetA(5)- 

We will construct Pu^ so that all the elements in each of 
these subsets will be equal with probability 1 , and independent 
of all other elements of Ut'. For any g G Gt, let Zg be a random 
variable such that 


Ut{e*) = Zg ,Ve* G 7lt(5) w.p. 1, 

7 piff] 

and Zg’s are independent for different p’s. 

Note that, by construction, knowledge of Ut and gt{E*) 
suffices to know 77^: 

77* = C/*(77*) = Ut{gt{E*)). 

Clearly, Pp. G P„(6) since p” G ei™^‘”®(5). 

We proceed to lower bound F”) for this distribution: 

I{U'^;Y^) = I{U'^-Y^,g^{E^)) - 7((7”;p”(P”)|F”) 

i 7([/”;F”|p”(P”)) - 7(f/”;p”(P”)|F”) 

> 7([/”;F”|p”(P”)) - H{g^{E^)) 

= 7(X”; F”|p”(P”)) - H{g^{E^)) 

7(77”; F”|P”) - H{g^{E")) (64) 

where (i) is because (7” is independent of g^{E^)-, (ii) is 
because 77” = [/”(p”(P”)) and the Markov chain [/” — 
(p”(P”), X”) —F”; and (iii) is due to the Markov chain P” — 
(p”(P”),X”) — F” and because p”(P”) is a deterministic 
function of P”. 

Now, observe that our distribution Pp^ on P” induces a 
distribution Px^\\E" on 77" which is identical to the one 
we constructed in the previous case: the joint distribution 
Px".can be factored as 

n 

Px".£;",(7"(a^”,e",u”) = W_PE{et)PutiYt)^{^t =Ut{A')}. 

t=i 

Summing over u„: 

Px.,/5..c/.-i(a:”,e”,u”-i) 

= (x”-\ e”-\ u”-i) 

^ Pi/(en)Pt/,i (t/n)l{^n — )} 
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X ^ ^’B(en)f’c/„(e")(u„(e”))l{a;„ = ■u„(e")} 

«„(e’*) 

Summing over Un-i, then m„_ 2, and so forth, we obtain; 

n 

t^l 

We can therefore apply ( [62l t to obtain 

-/([/"; y") > ,57(5") - i log (^) - -P(g"(P")). 
n 2 \ Z / n 

Finally, since g" was arbitrary we can maximize over all 
possible online policies: 


-/(17";r”)> max {^(5”) 
n g"Ger‘"'’(&) 


-iM'i)- 


Taking n ^ 00 and substituting in ([T^, we get (|30a|l. 


VII. Conclusion 

We studied the communication problem with an energy har¬ 
vesting transmitter over the AWGN channel. We characterized 
the information-theoretic capacity of this channel as an n-letter 
mutual information rate under various assumptions on the 
availability of energy arrival information. We also considered 
the power control problem for energy harvesting communica¬ 
tion that has been of interest in the recent communication the¬ 
ory literature and provided an approximately optimal solution 
for the online version of this problem. We then proceeded to 
connecting these two different formulations of the problem and 
showed that the information-theoretic capacity can be lower 
and upper bounded by the long-term average throughput, i.e. 
the solution of the power control problem. Putting these results 
together allowed us to approximate the information-theoretic 
capacity of the energy harvesting channel with a simple and 
insightful formula within a constant gap independent of system 
parameters. 

There are many interesting research directions one can 
pursue from here. One immediate question is whether the 
approximation results in this paper can be significantly tight¬ 
ened to obtain better approximations for the capacity. One 
can also seek purely multiplicative approximations instead 
of the additive approximations we derived in this paper. 
Another interesting direction is to develop similar insights and 
results for energy harvesting processes with memory or certain 
correlation structure over time. Finally, the approximation 
approach developed in this paper can be used to understand 
the information-theoretic capacity as well as optimal online 
power control for various multi-user settings. 


Appendix A 

Capacity and Maximum Throughput Do Not 
Depend on Initial Battery State 

A. Information-Theoretic Capacity Does Not Depend on Ini¬ 
tial Battery State 

We prove Proposition [T] namely that the capacity does not 
depend on the initial state of the battery Bq. Let C be the 
capacity of the energy harvesting channel when Bq is some 
arbitrary value in [0, B] unknown to the receiver, and let Cg 
be the capacity when Bq = B. We show that C = C§. 

It is immediate that Cg > C, since any achievable scheme 
for any value of Bq can be achieved when Bq = Bhy ignoring 
the remaining energy in the battery. To show Cg < C, we 
show that any achievable scheme designed for Bq = B can 
be achieved in a system with arbitrary Bq. 

We do so by transmitting a large number of zeros, therefore 
recharging the battery to B, followed by the scheme designed 
for Bq = B. More precisely, we transmit £ zeros followed by 
an (M, n, e/2) code for Bq = B. Denote by £i the event that 
the battery is not charged to B, and by £2 the event that the 
code for Bq = B will produce an error. 

First, we have 

Pr{£'i} = Pr{Bo + ^^^t <5} 

I 

<PrI^Et<B} 

t=i 

where lim^_).oo = 0, and the last inequality follows from 
the law of large numbers, using the fact that E[£'(] > 0. We 
can choose £ large enough so that Pr{£i} < e/2. Note that 
this value of I depends solely on B and the statistics of Ei, 
and does not depend on the actual value of Bq. 

Next, from the i.i.d. nature of Et, we have Pr{f2|'?i} < 
e/2. Therefore, the total probability of error for our scheme 
Prl^i U £2} does not exceed e. The transmission spans £-\-n 
channel uses, thus we have constructed an (M, n-\- £,e) code 
for the channel with arbitrary Bq. By taking sufficiently large 
n, we can get a rate as close to Cg as desired. ■ 

B. Throughput Does Not Depend on Initial Battery State 

We state and prove the following proposition: 

Proposition 3. The long-term average throughput does not 
depend on the initial battery state, i.e. for any hi, 62 G [0, B]: 

liminf max = liminf max 

ra-»-oo g'^eg„{bi) n-J-oo g"Gp„(62) 

for offline and online policies alike. 

This immediately implies that we can compute the through¬ 
put or for any initial battery level, say B, 

regardless of the actual battery level of interest 69. 

Proof: We will give the proof for online policies, however 
it transfers immediately to offline policies. Denote 

(6) = liminf max .^( 0 "). 

n—)-oo g"G5°"'‘"®(b) 
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We will show that for any 0 < 6 < S, 

which will imply the desired result. 

First, clearly since C 

for 0 < 6 < S. To show the other direction, let 
{ 5 "}^! be the sequence of maximal policies in 
that is: 


p" = argmax ^( 5 '") , n = 1, 2,... 

s"ee„(B) 


Fix i > 1. For any n > £, consider the following online power 
control policy p" for initial battery level b: Transmit zeros 
igt = 0) for the first £ time slots. This will allow the battery to 
completely recharge to B with high probability. Then, if B( = 
B, transmit the policy Otherwise, transmit zeros for 

n — £ time slots (i.e. give up on the entire transmission). More 
precisely, define the new policy as follows, for f = 1,..., n: 


0 ,1 < f ^ 

^{Bi=B} ' ,£+l<t 


where Ipj is the indicator function. Observe that B^ is a 
deterministic function of e^, which is given by Bi = min {6 + 
St=i 6t, .B}. We have for any n > £: 


w) = -E® 

-n 


log {1 + gt{E*)) 


1 ” n 

= - X! ® 9 log (1 + 1 {B£=.b} • 


n ^—' 2 

t=e+i 


-“X!® = • 9log(l+5i(B^+0) 


n—i 


-| /fc t -| 

^ ^ Pr{i3, = 5} . E - log(l + 9t{E^+l)) 


1 Pr{B^ = B}- 


1 


ilog(l + gt(B*)) 


= Fi{Be = B} 


n — - 


nr 


where (i) is because Bg depends only on E^, and Et is 
independent over time; and (ii) is because Et is i.i.d. 

Note that p” € for any 0 < b < B. Therefore: 


yonline(^) > inf 
n—>-oo 

= PT{Bi = B} ■ lim inf , 3 ^( 5 "-^) 

n—>-oo Tl 

= Pr{Bf = (65) 


We can lower-bound the probability of recharging the battery 
using Chebyshev’s inequality: 


Pr{B^ = B} = I -Vr {b + ^Et < B] 
{£ ■ E[Et] - B + 6)2 

2 ^ 1 

= 1 - Q, 


where 0 as ^ 00 . Substituting this in ( |65| ) yields 

r™'“®( 6 ) > (1 - e^) • r™'“®(B). Since the LHS does not 
depend on £, we can take £ —00 to obtain > 

T°niine( 5 )^ which concludes the proof. ■ 


Appendix B 

Capacity with Noncausal Side Information 
We prove Theorem for the case of energy arrival infor¬ 
mation available noncausally at the receiver and the trans¬ 
mitter Recall the definition of En{b) in ( [Thl l and fix 
Bx"|B" G En{b). We transmit k blocks of length n + £ + 1 
as in Section |IV] Generate a random codebook for every 

gnfc g gnk 


by generating k independently drawn codewords 
from Bx-|B"- Specifically, denoting 
we choose Xt(et) ^ Bxr>|£;n(-|ei) and transmit 


X 


i(n+e+l) 


^(i-l)(n+£+l) + l ~ (n-|-f-|-l)-|-n-|-l j 0]) 

where Zi = 1 if 6 (i_i)(„+f_|_i) > b and = 0 otherwise. 


and 0 is a length-^ vector of zeros. Similarly to Section IV 


the energy constraint will be satisfied. From here on, we 


repeat the arguments of Section IV The receiver makes use of 
y^ = 2/[-Zij(n+£+i)+r e,, * = 1 ,..., A:, for decoding. The 

channel is memoryless with i.i.d. side information available 
at both the receiver and the transmitter. Note that the i.i.d. 
Bernoulli RV Zi is independent of the side information e^. 
Therefore we obtain 


C noncausal 
TxRx 


> lim sup — 

n—>-oo R Py. 


sup ( 66 ) 

|E-GJr-„(b) 


Conversely, from Fano’s inequality we have 


< liminf- sup I{X^;Y^\E'^), 

n-*-oo n 


which, combined with (| 66 |), gives (| 2 T]i. 


Appendix C 

Alternative Representation of Energy 
Constraints 

In this section, we derive the alternative representation of 


the energy constraints stated in the beginning of Section VTA 


Suppose a;" and e" satisfy constraints (|^ and (|^ for t = 
1,... ,n and Bq = 6 , that is 

xi<bt, (67) 

6t = min{6t_i - a;t_i -f et,B}, (68) 

for t = 1,..., n, where 6 o = 6 . We show that this is equivalent 
to satisfying 


Y.^]<b+ y. 


t 


j=i+l 

t 


& + 

i=i i=i 

for t = 1,... ,n. 

Suppose x”, e" satisfy (|67]i and 


= (69) 

(70) 

. For any t = 1,..., n: 


Xt < B, 


(71) 
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Xt < bt-1 - xl_^ + et, (72) 

where ( fTT) gives ( [69] t for z = t, and ( |72| ) can be further written 
as 


x‘l_^ + < B + et, (73) 

xl_i + xl < bt-2 — Xt_2 + et-1 + et- (74) 

<0 gives ( |6^ for i = t — 1, and ( |74l l can be written as 

Xt_2 + Xt_i +Xt < min{6t_3 ~ Xt_2, + et-2, B} + Ct-i + et- 

Continuing in this fashion gives ( |69l l for alH < f and ( |70| . 

Now, let a:”,e" satisfy ( |69l l and ( |70l i for t = 

Applying ( |68l l for bi,... ,bt, we can express the battery state 
at time t as follows: 

t-i t t-i t 

bt = xmn\b-Y,xl+Y.^^, II ®^}}- 

2=1 2=1 j—i 

Hence, ( |67| ) holds if and only if; 

t-i t 

a;? < 7? - H = 

j=i j=i+l 

Xt <b-'^Xi + '^e^, 
i=l i=l 

for t = 1,... ,n, which is exactly (|69ll and (fTOli. 


Appendix D 

Noncausal Capacity Bounds 

We derive the bounds on the capacity with noncausal 
observations of the energy arrivals at the transmitter and the 
receiver, namely ( |3^ . We repeat the steps of Section [Vl] start¬ 
ing with the upper bound. Rewrite ( [Tb] ) as (see Appendix]^: 

J-nib) = ^Px^\E^ s.t. Ve" S £", a.s. for f = 1,..., n: 

t t 

'^X^<B+ ej = 

j=i j=i+l 

Yx^<b+Y^^}- 

i=i i=i 

Additionally, define 

Xnib) = ^Px^\E^ s.t. for f = 1,..., n and Ve" G 

t t 

Ye[X]\E^ = e^] < B + Y ea ,* = !,■ ■■,7, 

j=2 i^^+i 

t t 

Ym]\E'^ = n<b+Y^^]- 

i=i i=i 


Similarly, we can write (|24|l for the offline policies 


as 


s.t. Ve” e £” : 




5 Z - 1 , . . . , f, 


Esa(0<S+ y] e, 

j^i i-2+1 

t t 

HviCe”) < 6 + X^ej ,f = l, 
i=i i=i 

We now upper bound the expression in repeating 

the steps in Section VI-A First we upper bound the mutual 
information as: 




/(X”;y”|F;”) <YHXt;Yt\E^). 

t=l 

Next, we apply this inequality to ( [21) : 

C't"'®= lim - sup /(X”;r”|£;”) 

n-J-oo n 

1 " 

<liminf- sup Y I{Xt;Yt\E^) 

1 " 

<liminf- sup Y I{Xt-,Yt\E'’^). 


For each e” G £”, we have: 

I{Xt;Yt\E^ = e”) < ^ log (l + E[X2|F;" = e”]), 

and thus 


C noncausal 
TxRx 


< lim inf — sup E 
n->oo n Px,.|e-GJ^;(&) 


1 

y]-log(l+E[x2|i:;"]) 

.t=i ^ 


< lim inf — max E 

ra->oo n g"G5°™‘“(6) 

_ rTToffline 


1 

Y^\og{l + gt{E-)) 


For the lower bound, fix p” G 5°®'”® (5). Let 

n 

Px"|P"(T"|e") = P[Fxt|p"(a:(|e”), 
t=i 

where Px^iE^ixtle"^) = and is a distribu¬ 

tion with support [—•x/S', v^]- Clearly Px"|P" G Pn{b), and 


t=i 


Then, after maximizing over all such distributions, we get 
I{Xt\Yt\E^ = e”) = C'smith(5t(e”)), and therefore: 


> lim sup -E 

n—>-oo R 


> lim inf — E 

n—>-oo 77, 


C'sniith(yi(e'")) 

Yllog{l+gtiE^)) 


(75) 



where the second inequality is due to Lemma [T] Since this 
is true for any g” G 5°®‘”®(6), we can take the maximum to 
obtain ( [3^ . 


Observe that En{b) C 
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Appendix E 

Multiplicative Bounds; Prooe oe Theorem[5] 
Recall the proof of the lower bound in Section |VI-B| We 
will continue from equation ( |59l l. We will develop here a dif¬ 
ferent lower bound for C'smith(*S'), specifically a multiplicative 
lower bound. In what follows, we will show that 

C's„,ith(^) > ^ log(l + 5), 

for an appropriate p and all S' > 0. Substituting this in ( [59l l, 
( |64| l, and ( |75] l, immediately yields equations (|S4|l-(|3^. 

First, one can obtain from ( |M] l in the proof of Lemma [T] 
the following lower bound; 

Cs„ith(g) ^ + ^ 2 

ilog(l + S)- ilog(l + S) -Tre’ 

which implies p > ^ = 0.2342. However, it can be observed 
numerically that 77 is larger that this value, and it is in fact 
77 = 0.7473. In what follows, we show > „ yj^ ^ 

numerical proof We divide IR_|_ into five regions, and show 
the inequality holds for all S in each region. 


A. 0 < S < 0.69 

Consider a binary input distribution, that is = ±-\/S w.p. 
1/2. Denote 

Cbi„(S) ^ /(X; X + N)= I{Z- VSZ + N), 


where Z = ±1 w.p. 1/2. Then 

C'smith(^) ^ Cbm (5) 


5 log(l + S) 


> 


> 


CUS) 


S) - 


i log(l 
ve 

0. This implies i?(S) = ^ 1, and 


By 137 Lemma 1], we have Cbin(S) = 2in2 

—>• 0 as S , — 1 g 

2 In 2 

along with C'sniith(5’) < ^ log(l + S) we conclude 


o(S), where 


Cs„.ith(S) 


1 when S —> 0. 


i log(l + S) 

In fact, this was already observed by Shannon in his 1948 
paper p8|. 

1 r ^ 

Now, we will show that R{S) = '’1 / is non-increasing. 

By 137 Corollary 1], the function Cbm (S') is concave, implying 
that the derivative is non-increasing. By the mean 

value theorem, for every S' > 0 there is some 0 < c < S 
such that 

Cbm(5) - Cbm(O) Cbm(S) 

s ■ 


cLis) < cUc) = 


S-0 

Next, we take the derivative of R{S): 

21n2 _ Cbm(S) \ 


i?'(S) = 


S 


cLis) 


s J 


< 0 , 


observing that R{S) is monotonic non-increasing. 

We compute R{S) for S = 0.69 numerically to obtain 
77(0.69) = 0.7501. Since R{S) is non-increasing and i?(0) = 
1, this implies 


CsmithiS) 


iiog(i + s) 
for 0 < S < 0.69. 


> i?(S) > 0.75 


B. 0.69 < S < 170 

In this part, we compute Csmith(S) for a finite set of points 
{SJdi in [0.5,170], where 0.5 = Si < S2 < ... < Sat = 
170. For every Si < S < Si+i, 7 = 1,..., X — 1; 

Csmith(S) ^ Csmith(Si) 
ilog(l + S) - ilog(l + S,+i)' 


Computing this lower bound for any such set of points and 
taking the minimal value will give a lower bound for all S G 
[0.5,170]. We compute this numerically using the algorithm 
suggested in p5] and obtain 


Csmith(S) 

ilog(l + S) 


> 0.7473, 


0.5 < S < 170. 


C. 170 < S < 195 
For all 170 < S < 195, we have 

Csn,ith(S) ^ Csn,ith(170) ^ Csn,ith(170) 

5 log(l -f S) “ i log(l -f S) “ 5 log(l -f 195) 


D. 195 < S < 340 

Let Cunif(S) = I{X;X + N) where X - U[-VS,VS]. 
As before, we compute Cunif(S) numerically for a set of 
points {Sj}^i, where 195 = Si < ... < Sm = 340. This 
involves numerical integration of the form f f{y) log f{y)dy, 
where f{y) = :^{Q{y - VS) - Q{y + \/S)) and Q{x) = 

XT For every S^ < S < S^+i, j = 

1 ,...,M- 1 ; 

Csn.ith(S) ^ Cunif(S,) 
ilog(l + S) - ilog(l + S,+i)- 


Computing this lower bound numerically for the set of points 
Sj = 195 + (j — 1)0.5, j = 1,..., 291, and taking the 
minimum, gives; 


Csn.ith(S) 

i l0g(l -f S) 


> 0.7482, 


195 < S < 340. 


E. 340 < S 

For S > 340, we use Lemma [T] to see that 

Csmith(S) ^ |l0 g(l-fS)- |l 0 g(f^) 
ilog'(l-fS)“ ilog(l-fS) 

iog(¥) 

log(l-f340) 

= 0.7511. 


Combining all the above bounds, we conclude that 

Cs„,ith(S) 


ilog(l + S) 


> 0.7473, 


for all S > 0. 




































22 



Fig. 6. Numerical evaluation of the ratio Csmith(<S’)/1 log(l + S ). 


Appendix F 

Upper Bound on Expected Epoch Length in 
Generalized Bernoulli Policy 


As shown in Section V-B2 the epoch length in the Gener¬ 
alized Bernoulli policy is given by 


L = {inf t : St> B[1 - (1 - g)*]}, 

where St = ~ mm{Ei, B}. Consider the 

following RV: 

L — {inf t : St > B}. 

Clearly L < L, therefore: 


EL < EL 

oo 

= ^Pr(Z>£) 

OO 

= Y,P^(Si<B,S2<B,...,Si.i<B) 

OO 

= ^Pr(5,_i <5), (76) 

i=i 


where the last equality is due to the non-negativity of the 
energy arrivals Et > 0. Using Chernoff bound, we upper 
bound the probability in the sum, for any 0 > 0: 

Pr(S'f_i <B)< 

Substituting the in ( |7^ , we have an infinite series of the form: 

OO 

EL < ^e®^(E[e-®^‘])^”\ 

1=1 


Now, since Et > 0, we must have < 1 w.p. 1. This 

implies that E[e“®^‘] = 1 if and only if = 0 w.p. 1. 
However, since Et > 0 with positive probability, we must 
have E[e“®®‘] < 1. Hence the series must converge to a finite 
number and EL < oo. 
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